Diabase: Towards a Diachronic BLARK in Support of Historical Studies
نویسندگان
چکیده
We present our ongoing work on language technology-based e-science in the humanities, social sciences and education, with a focus on text-based research in the historical sciences. An important aspect of language technology is the research infrastructure known by the acronym BLARK (Basic LAnguage Resource Kit). A BLARK as normally presented in the literature arguably reflects a modern standard language, which is topicand genre-neutral, thus abstracting away from all kinds of language variation. We argue that this notion could fruitfully be extended along any of the three axes implicit in this characterization (the social, the topical and the temporal), in our case the temporal axis, towards a diachronic BLARK for Swedish, which can be used to develop e-science tools in support of historical studies.
منابع مشابه
Geographic landscape and Its Usage in Historical Studies
From the early 20th century, relying on consistency of geography and history, the science of Historical Geography became subject to attention in the field of historical studies, and different theoretical schools emerged to focus on the type of attitude towards it. Geographic landscape is one of the relatively new study areas in historical geography that examines changes of the nature in a defin...
متن کاملRecent Developments in Spanish (and Romance) Historical Semantics
Diachronic semantics has long been the stepchild of Spanish (and Romance) historical linguistics. Although many studies have examined (often in searching detail) the semantic evolution of individual lexical items, Hispanists have ignored broader patterns of semantic change and the relevant theoretical and methodological issues posed by this phenomenon. Working within the framework of cognitive ...
متن کاملAssessing frequency changes in multistage diachronic corpora: Applications for historical corpus linguistics and the study of language acquisition
The use of corpora that are divided into temporally ordered stages is becoming increasingly wide-spread in historical corpus linguistics. This development is partly due to the fact that more and more resources of this kind are being developed. Since the assessment of frequency changes over multiple periods of time is a relatively recent practice, there are few agreed-upon standards of how such ...
متن کاملMultiple Tokenizations in a Diachronic Corpus
This paper deals with the construction of a maximally flexible corpus architecture for building and analyzing diachronic corpora. Historical data poses many challenges with regard to representation and analysis, and diachronic corpora are even more varied and unsystematic (Claridge, 2008). Since historical and diachronic corpora are so difficult and expensive to build, it is crucial that they b...
متن کاملUsing Comparable Collections of Historical Texts for Building a Diachronic Dictionary for Spelling Normalization
In this paper, we argue that comparable collections of historical written resources can help overcoming typical challenges posed by heritage texts enhancing spelling normalization, POS-tagging and subsequent diachronic linguistic analyses. Thus, we present a comparable corpus of historical German recipes and show how such a comparable text collection together with the application of innovative ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010